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The Hotelling T-squared statistic has been widely used for the testing of differences in means for the multivariate data. The existing 
statistic under classical statistics is applied when observations in multivariate data are determined, precise, and exact. In practice, it 
is not necessary that all observations in the data are determined and precise due to measurement in complex situations and under 
uncertainty environment. In this paper, we will introduce the Hotelling T-squared statistic under neutrosophic statistics (NS) 
which is the generalization of classical statistics and applied under uncertainty environment. We will discuss the application and 
advantage of the neutrosophic Hotelling T-squared statistic with the aid of data. From the comparison, we will conclude that the 
proposed statistic is more adequate and effective in uncertainty. 


1. Introduction 


In classical statistics (CS), the univariate analysis is the 
technique to analyze the single-variable data. The multivariate 
analysis has been widely used to analyze data having more 
than one variable. In the multivariate technique under the CS, 
the Hotelling T-squared statistic has been widely applied in 
the variety of fields (see, for example, [1, 2]), for the testing 
either the means for more than one populations are equal or 
not. This statistic is the extension of the t-test, which is applied 
for the testing of the mean for the single population. Brereton 
[3] used the Hotelling T-squared statistic to detect the outlier 
in chemical data. In [4], Varmuza and Filzmoser worked on 
multivariate analysis for chemometric data. Hervé et al. [5] 
applied the multivariate technique on biological data. Kitaga 
ki et al. [6] used Hotelling T-squared statistic in chemical and 
electrochemical oscillator issues. For more details about the 
applications of the Hotelling T-squared statistic, the reader 
may read [3, 7] and [8]. 

The Hotelling T-squared statistic derived under the CS 
can be only applied for the analysis when all observations in 
the multivariate data are determined, precise, and certain. In 


practice, the data under study are not always precise but 
linguistic. For example, the temperature of a certain city may 
be high, low, and medium or the measurement of variable 
data in a complex system may lead to being in an interval 
rather than the determined values. In such situations, the 
Hotelling T-squared statistic under the CS cannot be used 
for the analysis of the data. When observations are uncertain 
or fuzzy, the fuzzy Hotelling T-squared statistic can be 
applied for the testing of means of multivariate populations. 
Taleb et al. [9] applied the fuzzy Hotelling T-squared statistic 
to design a control chart. D’Urso [10] provided a review on 
fuzzy multivariate analysis. Bakdi and Kouadri [11] pre- 
sented a new adoptive principle component analysis tech- 
nique to detect fault in a complex system. In [12], Ammiche 
et al. introduced principle component analysis for the 
Tennessee Eastman process using a fuzzy approach. More 
applications can be read in [13-15]. 

Recently, the neutrosophic logic, which is the extension of 
the fuzzy logic, attracted many researchers due to its appli- 
cations in the variety of fields. The neutrosophic logic con- 
sidered the measure of indeterminacy which fuzzy logic does 
not consider (see [16]). The neutrosophic statistics (NS) which 


is based on the neutrosophic numbers is the generalization of 
the CS (see [17, 18]). The NS has been applied widely in the 
rock-measuring issues (see, for example, [19, 20]). The ap- 
plication of the NS for the inspection of the product can be seen 
in [21, 22]. The applications of the NS in the area of the process 
control can be seen in [23, 24]. The application of the NS in 
medical can be read in [25]. For more information on neu- 
trosophic theory, the reader may refer to [26, 27]. 

Aslam and Smarandache [17, 18] pointed out some 
suggestions to extend the several concepts of CS to the NS. 
By exploring the literature and best of our knowledge, there 
is no work on the development of Hotelling T-squared 
statistic under the NS. In this paper, we will introduce the 
Hotelling T-squared statistic under NS, which is the gen- 
eralization of classical statistics and applied under uncer- 
tainty environment. We will discuss the application and 
advantage of neutrosophic Hotelling T-squared statistic with 
the aid of data. We expect that the proposed neutrosophic 
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The neutrosophic form of X,ye[X,, Xy] can be written 
as 


Xy = X; + XyIy; Ine[I; Iy]. (2) 


Note here that X,ye[X,, Xy] is the generalization of the 
data matrix under classical statistics. The data matrix under 
Xye[X,,Xy] reduces to the data matrix under classical 
statistics when I, = 0. 

The neutrosophic sample mean and _neutrosophic 
sample variance from ny measurements from py neu- 
trosophic variables are computed as follows: 


The neutrosophic form of Sig [ski Sal can be written as 
2 2 2 
Sen = Skr + Skul Ine Up Iu]. (6) 


Note here that s?,e[sz,,s{y] is the generalization of 
sample variance under classical statistics. The data matrix 
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The neutrosophic form of Sn € [Sikt Siy] can be written 
as 
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Hotelling T-squared statistic will perform better than the 
existing Hotelling T-squared statistic in uncertainty. 


2. Preliminaries 


Let Xien € [% jx» X jx] be a neutrosophic random variable, 
which represents the particular neutrosophic observation of 
the k™ variable that is noted from the j item. Note here that 
X jew € [% jx» X jeu] is expressed in the indeterminacy interval 
having the smaller value x ;,,and the larger value x j,. The 
neutrosophic form of X jn € [X kr X juz] having determinate 
part x ąz and indeterminate part x 4yIy3Iy € Hz, Iy] can be 
written as follows: x jy = X jer + XjkuIy; In € Hr Iy]. Note 
here that the neutrosophic random variable reduces to the 
variable under classical statistics if no indeterminacy is 
recorded in the data. The neutrosophic data matrix having 
ny E [n ny] neutrosophic observations of py € [Pr py] 
neutrosophic variables is given as follows: 


Xiku ` X1pu 
Xau Xjpu ||; XnelXi Xu]. (1) 


Xaku *** XnpU 
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The neutrosophic form of X,y€ [Xr Xpy] can be written 
as 


Xen = Xp + Xyly Ine [Ib Iy]. (4) 


Note here that Xyye[Xkr X,y] is the generalization of the 
sample mean under classical statistics. The data matrix under 
Xn € [Xp Xpy] reduces to the sample mean under classical 
statistics when I, = 0: 


(xu = w) | sine | Ski» sku) (5) 


2 eld 2 : 
under Sine lSip> Sjy] reduces to the sample variance under 
classical statistics when I, =0. 

The neutrosophic sample covariance between two 
neutrosophic variables are given by 


$ (jar 5 eu ((x je = =) Siew€ [Sir Siu]. (7) 


Sian = Sir + SuI; Tye Ub Iy]. (8) 
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Note here that SienE[S;ikr Sizu] is the generalization of 
sample covariance under classical statistics. The data matrix 
under Sine [Sizr, Siy] reduces to the sample covariance 
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The neutrosophic form of rine [fikr rigu] can be written 
as 


Tien = fikt Viel Ie [Ip Iu]. (10) 
Note here that rzyel[rjxp>7y] is the generalization of 
sample correlation under classical statistics. The data matrix 
under rj. € [rikr Tiny] reduces to the sample correlation under 
classical statistics when no indeterminate observations. 

The neutrosophic descriptive statistics for ny mea- 
surements and on py variables can be presented into the 
following arrays. 

The neutrosophic sample mean variance and covariance 


Xıl 
and correlation are presented by the array Xye : 
Xp, 
Xiu Sunt’? Step"? Sip 
Js Sirt SaL Sjpt b 
Xpy SniL*** SnkL*** SnpL 
Siu ttt Siku’ ** Sipu 
Sju: Sault Sjpu Jera 
Smu Saku SnpU 
lee fiar TipL 
Rye| | rarte lee Sh 
Spi *** Spr: 1 
le fu Tipu 
Taye lee Soper | respective 
Spyu + t: Spuw t: 1 


3. Neutrosophic Hotelling T;, Statistic 


In this section, we discuss the proposed neutrosophic 
Hotelling Tx statistic. In classical statistics, the student t-test 
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under classical statistics when indeterminate 
observations. 
Finally, neutrosophic sample correlation between the i® 


and k'™ variables is given by 


no 


(9) 


is applied for the testing of the mean for the univariate case. 
As mentioned by [28], rejecting the null hypothesis that 
means are equal when |tylis large is the same as rejecting the 
null hypothesis of its square: 

S 2 = 2 

2 = (X, = Hox) (Xu 7 Hou) 
N s/n, ‘ Sed Mey 
a 2 =] 
=|n; (X; - Hor) (s1) ny 
= 2 
- (Xu - bow) (st) (Xu - Hou) I telti te t a 
(11) 


e[t}, t7,] can be written as 
ty = tp tylys Iye[Ip Ty]. (12) 


Note here that tye[t7,t7,] is the generalization of 
Hotelling T-squared statistic under classical statistics. The 
data matrix under tie[t?,tz,] reduces to the Hotelling 
T-squared statistic under classical statistics when no inde- 
terminate observations. 








The neutrosophic form of t3, 


For the given values of X,\€[X,,,X,y] and 
Sine [Sk Sy], the null hypothesis will be rejected if 
a 
telti ty] > th, (5). (13) 


where a is the level of significance and t,, _,(a/2) is upper 
100 (a/2)" percentiles of the neutrosophic t-distribution 
with the neutrosophic degree of freedom ny — 1. 

The generalization of equations (1) and (2) for the 
multivariate case under the neutrosophic statistical interval 
method (NSIM) is given by 


(14) 


The neutrosophic form of T4 e[T?, T7,] can be written as 


Toat Toa hella. (15) 


The statistic is given in equation (14) is called neu- 
trosophic Hotelling T%, statistic and has neutrosophic 
F-distribution with neutrosophic degree of freedom (ndf) 


Pw and (ny - py): 


2 (ny - 1) 2 2 m2 
Ty ~ Go niea Th e|Ti To]: (16) 

The neutrosophic Hotelling T% statistic can be used for 
the testing of hypothesis Hoy: Hy = Hoy and alternative 
hypothesis Hoy: Uy Hoy. The Hon: Hn = Yon Will be 
rejected if 


ny -1 

TNE [TTo] oE F pn (o-ta) (17) 
The software provides the p value in making a decision 
about the acceptance or the rejection of the null hypothesis. 
According to [18], “a neutrosophic p value is defined in the 
same way as in classical statistics: the smallest level of sig- 
nificance at which a null hypothesis H, can be rejected.” 
Note here that the neutrosophic p value is not an exact or 
determined value as in the case of classical statistics. 
Smarandache [18] discussed criteria to accept or reject the 

null hypothesis using the neutrosophic p value. 


4. Application 


Now, we discuss the application of the proposed neu- 
trosophic Hotelling T4 statistic using data selected from the 
healthcare department. The data are collected from 20 
healthy women and three variables, which are sweat rate, 
sodium, and potassium contents are measured. The obser- 
vations of variables underinvestigated will be obtained from 
the measurement process. It is expected that not all ob- 
servations in the data are precise and exact. Therefore, it 
cannot be analyzed using CS. Similar data for classical 


Ti = [n (D,)' (S1) i (D1), ny (Dy) (Su) (Dy), 
Tx = 20[ [0.64,0.64] [-4.6,4.6] [-0.035, -0.035] ] 

(2.8793, 2.8389]  [10.01, 10.04] 
[199.7884, 200.71] 


Step 5: the critical region is using equation (5) is given 
as 
o _ (20-1) 
0N (20-37 (20) 


Fọo,10,3,17 = [8; 17, 8, 17]. 


Step 6: as T}, = [9.7387, 11.4176] > Ty = [8.17, 8.17], 
[4, 4] 
we reject Hox: Mon = | [50,50] |. 
[10, 10] 
5. Comparisons 


In Section 4, we presented the testing procedure for the 
proposed neutrosophic Hotelling T. The proposed 


[-1.8090, —1.8284] 7' 
[-5.64, —5.72] 
[3.6276, 3.5830] 


Journal of Analytical Methods in Chemistry 


statistics are given by [28]. The data having some neu- 
trosophic observations are shown in Table 1. We want to test 
that the means of three groups for the healthy women have 
the same population means. We state null and alternative 
hypotheses as follows: 


[4,4] 
Step 1: Aon: Hon = fiosa | vs Hyn: Mn # 
| [4, 4] | [10, 10] 
[50, 50] |. 
[10, 10] 


Step 2: some basic calculations for the data are given in 
Table 1 are shown as 





[4.64, 4.70] 
Xy =| [45.5, 45.47] |, 
[9.965, 10.1] 
[4,4] 
Hon = | [50,50] |, 
[10, 10] 
[0.64, 0.70] 
Dy = (Xy - Hy) = | 4.6, -4.53 |, 
—0.035, 0.01 
[2.8793, 2.8389] [10.01, 10.04] [-1.8090, -1.8284] 
Sy= 199.7884, 200.71  [—5.64, —5.72] 
[3.6276, 3.5830] 
(18) 


Step 3: let æ = [0.10, 0.10] be the level of significance. 
Step 4: the neutrosophic Hotelling T statistic is 


[0.64, 0.64] (19) 
[-4.6, 4.6] = [9.7387, 11.4176]. 
[-0.035, —0.035] 


neutrosophic Hotelling T% is the generalization of CS. The 
proposed neutrosophic Hotelling T}, testing procure re- 
duces to the testing procedure under CS when all obser- 
vations of sweat data are precise. From neutrosophic sweat 
data, we note that the proposed testing procedure provides 
the analysis values in the indeterminacy interval rather than 
the determined values. The neutrosophic form of proposed 
Hotelling statistic is T}, = 9.7387 — 11.411; Iye[0, 0.1470]. 
For example, the proposed Hotelling statistic has the in- 
determinacy interval from 9.73 to 11.41. It means, under 
uncertainty environment, one can expect the values of Ta 
from 9.73 to 11.41. The first value 9.73 of the indeterminacy 
interval of T%, shows the determined part, and 11.41 is an 
indeterminate part. When imprecise observations are noted 
in the sweat data, the value of TẸ, is 9.73 which is under the 
CS. In other words, when the level of significance is 5%, the 
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TABLE 1: The neutrosophic sweat data. 











Individual X, (sweat level) X, (sodium) X, (potassium) 
1 [3.7, 3.7] [48.5, 48.7] [9.3, 9.3] 
2 [5.7, 5.8] [65.1, 65.1] [8.0, 8.1] 
3 [3.8, 3.8] [47.2, 47.3] [10.9, 10.9] 
4 [3.2, 3.3] [53.2, 53.3] [12.0, 12.0] 
5 [3.1, 3.1 [55.5, 55.5] (9.7, 9.8] 
6 [4.6, 4.8] [36.1, 36.2] 7.9, 7.9] 
7 [2.4, 2.4] [24.8, 24.8] [14.0, 14.0] 
8 (7.2, 7.3] [33.1, 33.2] 7.6, 7.7] 
9 (6.7, 6.7] [47.4, 47.4] [8.5, 8.6] 
10 [5.4, 5.5] [54.1, 54.2] [11.3, 11.3] 
11 [3.9, 3.9] [36.9, 36.9] [12.7, 12.7] 
12 [4.5, 4.6] [58.8, 58.9] [12.3, 12.4] 
13 [3.5, 3.5] [27.8, 27.9] (9.8, 9.8] 
14 [4.5, 4.5] [40.2, 40.2] [8.4, 8.5] 
15 [1.5, 1.7 [13.5, 13.5] 10.1, 10.2] 
16 [8.5, 8.5] [56.4, 56.5] (7.1, 7.1] 
17 [4.5, 4.7] (71.6, 71.9] [8.2, 8.2] 
18 [6.5, 6.5] [52.8, 52.8] [10.9, 10.9] 
19 [4.1, 4.2 [44.1, 44.2] [11.2, 11.3] 
20 [5.5,5.5] [40.9, 40.9] [9.4, 9.5] 





probabilities that the null hypothesis is accepted, rejected, 
and indeterminate are 0.95, 0.50, and 0.1470. By comparing 
the proposed test with the test under CS, we note that the 
existing test is unable to tell about the probability of the 
indeterminacy. As mentioned by [19, 20] that a method that 
provides the values in an indeterminacy interval under 
uncertainty is considered as the most effective and adequate 
method. By comparing the proposed testing procedure with 
the existing under CS, our theory is the same as in [19, 20]. 


6. Concluding Remarks 


In this paper, we introduced the Hotelling T-squared statistic 
under neutrosophic statistics (NS) which is the generalization 
of classical statistics and applied under uncertainty environ- 
ment. We discussed the application and advantage of neu- 
trosophic Hotelling T-squared statistic with the aid of data. The 
proposed neutrosophic Hotelling T-squared statistic is 
expressed in the indeterminacy interval and hence more 
flexible and information than the Hotelling T-squared statistic 
under classical statistics. Based on the comparison, we rec- 
ommend using the proposed neutrosophic Hotelling 
T-squared statistic for the analysis of the data under uncer- 
tainty. Some more properties of the proposed neutrosophic 
Hotelling T-squared statistic can be studied as future research. 
The sensitivity of the proposed statistic to uncertainty and 
measurement errors can be studied in future work. 
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